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Abstract 



(vq We study how the round-off error changes the statistical properties of a Gaussian long memory 

process. We show that the autocovariance and the spectral density of the discretized process (i.e. 

[-h the process with round-off error) are asymptotically rescaled by a factor smaller than one, and 

C/} we compute exactly this scaling factor. Consequently, we find that the discretized process is also 

long memory with the same Hurst exponent as the original process. We consider the properties of 
two estimators of the Hurst exponent, namely the log-periodogram regression and the Detrended 
Fluctuation Analysis (DFA). By using numerical simulations and analytical considerations we 
show that the estimators of the Hurst exponent of the discretized process are severely negatively 
biased. We compute the asymptotic properties of the DFA for a generic long memory process and 

\q we apply the result to discretized processes. 

Keywords: Long-memory processes, Round-off error, Measurement error, Log-periodogram 
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1. Introduction 



"All economic data is discrete" (Engle and Russell (2004)). Round-off errors occur whenever 
a real valued process is observed on a grid of discrete values. A special case of round-off error is 
obtained by taking the sign of a real valued process. Round-off errors change the properties of the 
original stochastic process. Few exact results exist on the effect of round-off error on stochastic 
processes. Delattre and Jacod (1997), for example, proved a central limit theorem on a Brownian 
motion with round-off error sampled at discrete times. In this paper we study analytically and 
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numerically the effect of round-off error on long memory processes. We will use the term dis- 
cretized process to refer to the process with round-off error and the term discretization to refer to 
the rounding procedure. 

Round-off errors can be either due to a limit in the resolution of the observing device or to 
the fact that an underlying real valued process can manifest itself only as a discrete valued pro- 
cess. When seen as a resolution effect, round-off error can also be considered as a special case 
of measurement error. One recent strand of econometric literature considers the problem of esti- 
mation of a process that can be observed only with some noise due to the measurement process. 
The typical modeling approach is to consider the measurement error as an additive white noise 
process uncorrelated with the unobserved latent process (see, for example, Hansen and Lunde 
(2010)). Round-off error can be considered as a different form of measurement error, which is a 
deterministic function of the underlying latent process and it is not uncorrelated with it. 

An important example of round-off error due to the fact that the process manifests itself only as 
a discrete valued process is the dynamics of asset prices. Despite the fact that the price of an asset is 
a real number, transaction prices (but also quote prices) can assume only values which are multiple 
of a minimum value called tick size. For transaction by transaction data the tick over price ratio 
can be large leading to a price dynamics that cannot be even approximated as a real valued process. 
In the market micro structure literature the round-off error is one of the main sources of disturbance 
in the estimation of integrated volatility. Several papers have considered how tick size affects the 
diffusion dynamics of price on short time scales. For example, Gottlieb and Kalay (1985) and 
Harris (1990) developed microstructure models to investigate the effect of price discretization on 
return variance and serial correlation. More recent papers consider the effect of price discretization 
on phase portrait of returns (Szpiro (1998)), on price-dividend relation (Bali and Hite (1998)), and 
on integrated volatility (Rosenbaum (2009)). See also La Spada et al. (2011) and references 
therein for a recent review of the effect of tick size on the diffusion properties of financial asset 
prices. Beside market microstructure, discretized processes emerge naturally in discrete choice 
models, binned data, computer vision, detectors, and digital signal processing. Recently, the effect 
of discretization has also been studied in the framework of rational inattention (Saint- Paul (201 1)). 

Long memory processes are ubiquitous in natural, social, and economic systems (see Beran 
(1994)). However, a large part of the theory, statistics, and modeling of long memory processes 
is based on the assumption of normality of the distribution and very often also on the hypothesis 
of linearity of the generating mechanism. A major step toward generalization has been the pa- 
per by Dittmann and Granger (2002) that investigated the properties of a process obtained from 
a non-linear transformation of a Gaussian long memory process. Their main result is that when 
the transformation can be written as & finite linear combination of Hermite polynomials, the trans- 
formed process has the same or a smaller Hurst exponent of the original process, depending on 
the Hermite rank of the transformation (see below for a more precise definition of these terms). 
However, their results "need not hold for transformations with infinite Hermite expansion" (see 
Dittmann and Granger (2002)). Another problem is that, to the best of our knowledge, little is 
known about the properties of common estimators of the Hurst exponent when they are applied 
to non-Gaussian and non-linear time series. Note that even Dittmann and Granger (2002) did not 
discuss the issues related to the estimation of the Hurst exponent of non-linear transformations 
of Gaussian time series. The present paper contributes in this direction by presenting several re- 

2 



suits for a specific, yet important, class of non-linear transformations of a Gaussian long memory 
process, namely the class of discretization transformations. 

In this paper we present an in-depth analysis of the properties of a stochastic process obtained 
from the discretization of a large class of Gaussian long memory processes. We give the asymptotic 
behavior of the autocovariance and of the spectral density, by computing explicitly the leading term 
and the order of the second term in a series expansion. We find that the autocovariance and the 
autocorrelation are asymptotically rescaled by a factor smaller than one, and the Hurst exponent 
is the same for the continuous and the discretized process. The spectral density is also rescaled 
for small frequencies by the same scaling factor as the autocovariance. We find an explicit closed 
form of this scaling factor. It is worth noting that the decrease of the autocorrelation function holds 
for the discretization of any Gaussian weakly stationary process, either long-memory or short- 
memory. Our results are consistent with those in Hansen and Lunde (2010) on the estimation of 
the persistence and the autocorrelation function of a time series measured with error. 

We then consider two classic methods to estimate the Hurst exponent, namely the log-periodogram 
regression introduced by Geweke and Porter-Hudak (1983) and the Detrended Fluctuation Anal- 
ysis (DFA) introduced by Peng et al. (1994). The log-periodogram regression is a very popular 
semi-parametric method for the investigation of long memory properties of generic processes. 
It is frequently used in empirical work in economics and financial econometrics. For the log- 
periodogram regression we show that most of the results in the literature on the asymptotic consis- 
tency and normality of the estimator (see Hurvich and Beltrao (1993), Robinson (1995), Hurvich 
et al. (1998), Velasco (2000)) do not hold for the discretized process. We study the bias of the log- 
periodogram estimator due to the high-frequency component of the spectral density. We show that 
this bias is much more significant for the discretized process than for standard continuous pro- 
cesses such as the fractional Gaussian noise (fGn) and the fractional AutoRegressive Integrated 
Moving Average (fARIMA) process. By comparing the periodogram of the discretized process to 
that of the continuous process we argue that this bias is negative. 

The DFA (Peng et al. (1994)) is another very popular semi-parametric method for the investi- 
gation of long memory properties of generic processes. It was introduced more than fifteen years 
ago to investigate physiological data, in particular the heartbeat signal. Since its introduction it 
has been applied to a large variety of systems, including physical, biological, economic, and tech- 
nological data. In a recent paper Bardet and Kammoun (2008) computed explicitly the asymptotic 
properties of the DFA for the fGn and for a general class of Gaussian weakly stationary long mem- 
ory processes. Here we generalize their results to a non-Gaussian generic long memory process 
and we applied them to the discretized process. As a byproduct we show that the order of the error 
of the root mean square fluctuation given by Theorem 4.2 of Bardet and Kammoun (2008) is not 
correct for a generic Gaussian process. Because of the cancellation of a term the theorem is correct 
for the fGn and the fARIMA process as claimed in Bardet and Kammoun (2008). By comparing 
the root mean square fluctuation of the discretized process to that of the continuous process we 
argue that the second-order term induces a negative bias in the estimation of the Hurst exponent. 

The paper is organized as follows. In Section [2] we define the class of long memory processes 
we consider in the present paper. In Section|3]we derive some analytical results on the distribution, 
autocovariance, and spectral density of the discretized process. Section [4] presents analytical and 
numerical results on the estimation of the Hurst exponent obtained by using the periodogram and 
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the DFA. Finally, Section [5] concludes. 

2. Long memory processes 

In this paper we are interested in studying the effect of round-off error on a long memory pro- 
cess. There are several possible definitions of a long memory process. The very general definition 
we are using in the present paper is the following. 

Definition 1. A discrete time weakly stationary stochastic process {jc(/)}ieN is l° n g memory if its 
autocovariance function y(k) behaves as 

y{k) = k~ a L(k) for k>\, (1) 

where a 6 (0, 1) and L{k) is a slowly varying function at infinity^ 

Under this definition of long-memory processes the autocovariance function does not neces- 
sarily decay as a pure power-law. Consider the case L(k) = log k, or any power of the logarithm 
function. The autocorrelation function of a long memory process is not integrable in k between 
and +oo and, as a consequence, the process does not have a typical time scale. 

Long memory processes can be characterized by the exponent a describing the asymptotic 
behavior of the autocorrelation function or equivalently in terms of the Hurst exponent H that, 
for long memory processes, is H = 1 - a/2 e (0.5, 1). Long memory processes can also be 
characterized in terms of the fractional differencing parameter d = (1 - a)/2 e (0,0.5). In this 
paper we will use interchangeably a, H, and d for characterizing the long memory properties of 
the process. 

The class of long memory processes defined above is quite large due to the arbitrariness in 
the choice of the slowly varying function L(k). Some of the results we will present below hold 
for a more restricted class of long-memory processes characterized by the properties of the slowly 
varying function in Definition [T] 

Following Embrechts et al. (1997) we denote with "R the set of slowly varying functions at 
infinity. We introduce the following definition 

Definition 2. We define the set of well behaved slowly varying function as 

£(/, buPd = J I € 7?o : m = J] b ' k ' P ' for lar § e k \ & 

with I < oo, b > 0, j3 = 0, and 0i < fi i+ i V i = 0, 1, . . . ,7. Moreover the parameters b { and/Sj are 
such that 2^=0 bjk~^ converges absolutely for large k. 



l L(x) is a slowly varying function if ]ha x ^ llo L(tx)/L(x) = 1, W > (see Embrechts et al. (1997)). In the 
definitions above, and for the purposes of this paper, we are considering only positively correlated long-memory 
processes. Negatively correlated long-memory processes also exist, but the long-memory processes we will consider 
in the rest of the paper are all positively correlated. 



Note that all the slowly varying functions which are analytic at infinity are well behaved with 
/ = oo. 

We are interested here in the discretization of long memory processes characterized by a Gaus- 
sian distribution. The Gaussian time series is completely characterized by the mean (hereafter 
assumed to be zero), the diffusion coefficient D and the autocorrelation function p(k) = y(k)/D. 
Two classes of Gaussian long memory processes are often considered in the literature. The first 
one is the fractional Gaussian noise (fGn) (see Mandelbrot and van Ness (1968)) characterized by 
the autocorrelation function 

P(k) = \[(k+ l) m + (k- \) 2H - 2k 2H ] (3) 

For large k the asymptotic expansion of this expression is p(k) ~ H f 2 H iH l) (l + (2g ~ 2 ^ 2//-3) A. + . . . j. 
The second important example is the fARIMA(0, d, 0) process whose autocorrelationjis 

_ 1X3/2 -Zg)r(Z/ + *- 1/2) 

yK J Y(H-l/2)Y(k + 3/2-H) w 

where T(-) is the gamma function. For large k the asymptotic expansion of this expression is 

(1A TQ/2-H) 1 (, (4H 3 -12H 2 + llH-3) 1 \ 

AW ~ T(H-l/2)k 2 - 2H V 12 k 2 "•" • • •)• 

Note that both for the fGn and for the fARIMA processes the slowly varying function L(k) is 
analytic at infinity and therefore it is well behaved according to Definition [2} In the following we 
will present results for the discretization of generic Gaussian long memory processes, and we will 
consider the fGn or the fARIMA as special cases. 

3. The discretized process 

Given a discrete-time and real valued process {jc(/)}, £ n, the discretization procedure consists 
in setting a grid of points jS with j e Z and the value of the discretized variable at time i is 
Xd(i) = [x(i)/8]6, where [z] is the integer part of z. The parameter 6 sets the level of round-off 
error. This type of discretization appears, for example, whenever a weakly stationary process is 
discretized through a binning procedure. 

The probability mass function of the discretized process is 

oo 

Pd(x) = 2_j 9nS(x - nS) (5) 

«=-oo 

where 

q n = I p(x)dx (6) 

J S(n- 1/2) 

and 5{x) is the Dirac delta function. 



In the following we consider only fARIMA(0, d, 0). Therefore, when in the following we refer to fARIMA, we 



mean fARIMA(0, d, 0). For results on more general fARIMA see Section 4.3 
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It is useful to introduce the adimensional scaling variable 



£ = 



D 



(7) 



Since x is Gaussian distributed, the variance D d of the discretized process can be calculated explic- 
itly. For detailed analytical results on the distributional properties of the discretized process see 
Appendix A Left panel of Figure [T] shows the ratio Dj/D as a function of the scaling parameter 
£. It is worth noting that this ratio is not monotonic. For small t, the ratio goes to zero because 
5 is very large and essentially all the probability mass falls in the bin centered at zero. Finally, 
the parameter f sets the fraction q of points which in the discretized process have value zero. 
It is direct to show that q = erf l/2-^2f , where erf[-] is the error function. This is clearly a 
monotonically decreasing function. In the numerical examples below we will use £ = 0.1, 0.25, 
and 0.5 corresponding to q = 0.886, 0.683, and 0.521, respectively. 

A different type of discretization that we will consider below is obtained by taking the sign of 
the variable x. Assuming that the distribution function of x is absolutely continuous with respect to 
the Lebesgue measure in a neighborhood of 0, so that the event x(i) = has zero probability, this 
discretization leads to Xd{i) = sign(x(i)) = ±1 with probability 1. Trivially, when the discretization 
is obtained by taking the sign function the variance of the discretized process is D s = 1 . 





Figure 1: Left panel. Ratio between the variance Dj of the discretized process and the variance D of the original 
Gaussian process as a function of the scaling parameter g. The inset shows the fractional change Dj/D- 1 as a function 
of £ in a log log scale. The dashed line shows the f behavior. Right panel. Ratio between the squared coefficient 



g\ (see formula \\2\) and the variance of the discretized process as a function of the scaling parameter (, This ratio is 



equal to the ratio between the autocorrelation function pd(k) of the discretized process and the autocorrelation function 
p(k) for large values of lags k (see Corollaryfl]). 



3.1. Autocovariance and autocorrelation function 

In order to calculate how the time correlation properties change when a weakly stationary 
Gaussian process is discretized we will make use of a general theory presented, for example, in 



Beran (1994) and Dittmann and Granger (2002). For the benefit of the reader we recapitulate here 
this approach. 

We consider first the case of an underlying weakly stationary Gaussian process of unit vari- 
ance. In order to extend the theory to non-unit variance we need simply to do the transformation 
x — » xj VZ). The starting point is a series expansion of the bivariate Gaussian density function in 
Hermite polynomials. Hermite polynomials are an orthonormal polynomial system with Gaussian 
weight. Specifically, 

-x 2 /2 



Cx 



e 
H n (x)H m (x)——dx = 6 nm (8) 

V2^r 

where 8 nm is the Kronecker delta. The expansion of the bivariate Gaussian density function P(x,y) 
in Hermite polynomials is (see Barrett and Lampard (1955)) 



P(x,y) = P(x)P(y) 



+ Y J p j Hj(x)H j (y) 



(9) 



where P(x) is the univariate Gaussian density function andp is the correlation coefficient between 
variables x and y. 

Following Lemma 1 of Dittmann and Granger (2002), if we transform two Gaussian random 
variables x and y with a nonlinear transformation g(x) that can be decomposed in Hermite polyno- 
mials 

oo 

g(x) = g Q + Y J g j H j (x) (10) 

7=1 

the linear covariance and the linear correlation of the transformed variables are 

oo v^ oo 2 / 

Cov[g(x)g(y)] = J] g 2 jP J Cor[g(x)g(y)] = -£^ (11) 

7=1 -^7=1 8j 



The proof is straightforward. The coefficients gj in ( 10) are 

? -x 2 /2 



Xoo - x -ji 

g(x)Hj(x)^dx (12) 

oo V27T 



The smallest j > such that gj is non- vanishing is called the Hermite rank of the function g(x). 



Note that the second equation in (11) implies that any non-linear transformation of a bivariate 
Gaussian distribution decreases the correlation between the variables (the covariance can of course 
increase or decrease). When x and y describes the same process at two different times the above 
equations can be used to compute the autocovariance and autocorrelation properties of the trans- 
formed process. 

Dittmann and Granger (2002) used the above expansion to study how the long memory prop- 
erties change as a result of a nonlinear transformation that can be written as a finite Hermite 
expansion. (See Proposition 1 of Dittmann and Granger (2002).) They mention that this approach 
cannot be used if the the transformation has an infinite Hermite expansion. As we will show be- 
low, discretization can be expressed as an infinite sum of Hermite polynomials and therefore we 
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cannot use directly Proposition 1 of Dittmann and Granger (2002). Therefore in the following we 
compute explicitly the asymptotic behavior of the autocovariance function and the small frequency 
series expansion of the spectral density and we directly obtain the long memory properties of the 
discretized process. 

3.1.1. Discretized process 

By using the theory outlined above we compute the asymptotic behavior of the autocovariance 
jd(k) and the autocorrelation pd(k) of the discretized process. 

Proposition 1. Let WOLeN be a discrete time Gaussian weakly stationary process with auto- 
covariance function given by Definition [7J Then the autocovariance function of the discretized 
process [xd(.i)}ien satisfies 



7d(k) 



'# 2 (0,<r 1/2 0' 2 



where & a (u, q) is the elliptic theta function. 



k~ a L(k) (l + O (k- 2a L 2 (k))) as k 



oo . 



(13) 



The proof of this and of the other propositions in this section are in Appendix B According 
to Definition[TJ this proposition proves that the discretized process is a long memory process with 
the same Hurst exponent as the original process. Note that asymptotically the autocovariance of 
the discretized process is proportional to the autocovariance of the original process. 

Moreover, as for the original process, the autocovariance function does not necessarily decay as 
a pure power-law. Note also that the second order corrections are nested inside the slowly varying 
function, and it may happen that the first order term in 0(k~ 2a L 2 (k)) dominates the second order 
term in k~ a L(k). This might be the case, for example, if L(k) converges to a constant. To compute 
explicitly the second-order corrections we need to specify a functional form for the slowly varying 
function L(k). To this end we consider long memory processes whose slowly varying function in 
the autocovariance is well behaved according to Definition [2} We prove the following: 

Proposition 2. Let WOKeN be a discrete time Gaussian weakly stationary process with autoco- 
variance function given by Definition^and L e £,(I,b(,fii). Then the autocovariance function of 
the discretized process {Xrf(0}jeN satisfies 



7d(k) 



MO, e-^)) 



where d- a (u, q) is the elliptic theta function. 



b k- a (l + o(k- min(2a ^ ) )) ask 



oo. 



(14) 



The behavior of the autocorrelation function is trivially obtained from the two above proposi- 
tions. For example, for the more general case we have the following. 

Corollary 1. Under the conditions in Proposition [7J the autocorrelation function Pd(k) of the 
discretized process satisfies as k 



oo 



Pd(k) 



fl 2 (0,e- 1/2 ^ 



k~ a L(k) 
D, 



(l + 0(k- 2a L(k) 2 )) 



g\ L(k) 
D H k 2 ~ 2H 



(l+0(k 4H - 4 L(k) 2 )) (15) 



For large k, g\/D d is the ratio of the autocorrelation function of the discretized process and the 
underlying process. The right panel of Figure [II shows g\/D d as a function of £. We observe that 
the more severe is the discretization, the larger is the reduction of the autocorrelation function. 

We numerically tested our propositions and the error made by considering only the leading 
term in the asymptotic expansion. We simulated a fGn with unit variance for which lim^o, L(k) = 
H(2H - 1). Figure [2] shows the sample autocovariance of the discretized process and that of the 
continuous process for f = 0.1 and H = 0.7 and H = 0.85 and different sample size, namely 
N = 2 W = 1,024 and N = 2 U = 16, 384. Before discussing this figure, we remind that the sample 
autocovariance (and the autocorrelation) is a biased estimator for long memory time series. Hosk- 
ing (1996) showed that for a generic long memory process with an autocovariance asymptotically 
decaying as y{k) ~ Ak~ a with < a < 1 the sample covariance y(k) has an asymptotic bias 

— 2AN~ a 

E[y(k)] - y(k) ~ — as N -» oo (16) 

(1 -a){2-a) 

where N is the length of the sample time series. Since Hosking's theorem only requires that the 
process is long memory, we can apply it also to the discretized time series. Figure [2] shows a very 
good agreement between simulations and analytical results (with bias correction) of the asymptotic 
autocorrelation function. 

3.1.2. Sign process 

Similarly to above, we can compute the autocovariance and the autocorrelation function of the 
sign of a Gaussian weakly stationary process. 

Proposition 3. The autocovariance function y s (k) and the autocorrelation function p s (k) of the 
sign of a discrete time Gaussian weakly stationary process with autocorrelation p(k) is 

2 
y s (k) = p s {k) = - arcsinp(fc) (17) 

n 

Therefore also the sign transformation preserves the long memory property and the Hurst ex- 
ponent. 

If the autocorrelation p is small (for example if the lag k is large) we have 

y s (k) = p s {k) = - (p(k) + ^ ) + 0{p 5 (k)) (18) 



n\ 6 

This expression has been obtained several times, as, for example, in the context of binary time 
series (see Keenan (1982)). 

Figure [3] shows the autocorrelation of the sign process of a fGn for different values of H to- 
gether with the first approximation and the exact result. Also in this case the agreement between 
simulations and theory is very good. 

3.2. Spectral density 

From the Wiener- Khinchin theorem we know that the spectral density of a weakly stationary 
random process is the Fourier transform of the autocovariance function. We use this fact to calcu- 
late the small frequency behavior of the spectral density of the discretization of a Gaussian long 
memory process. 
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Figure 2: Sample autocovariance of a numerical simulation of a fGn and its discretization with a scaling parameter 
£ = 0.1. The time series has length N = 2 10 (top) and N = 2 14 (bottom). The Hurst exponent of the fGn is H — 0.7 
(left) and H - 0.85 (right). The figure also shows the asymptotic theoretical autocovariance and the autocovariance 



corrected for the finite sample bias of ( 16 1 both for the fGn and for its discretization. 



3.2.1. Discretized process 

In the case of a discretization with scaling parameter £ we prove the following proposition: 

Proposition 4. Let WOLeN be a discrete time Gaussian weakly stationary process with autoco- 
variance function given by Definition [7] The spectral density <f>d(a>) of the discretized process 
[Xd(i)}m satisfies 



lim 4> d ((o)/ 



w^0+ 



V^ 



c^\(o\ a - l L{idr l ) 



= 1 



(19) 



where #„(w, q) is the elliptic theta function, and c^ = n T(l - a) sin(na/2). 



Moreover, similarly to the case of the autocovariance, second-order corrections to the spectral 
density are hidden inside the slowly varying function L(k) and the terms we neglected. To compute 
these corrections explicitly in terms of powers of \co\ we need to specify a functional form for the 
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Figure 3: Sample autocovariance of a numerical simulation of a fGn and its sign. The time series has length N - 2 10 
(top) and N = 2 14 (bottom). The Hurst exponent of the fGn is H - 0.7 (left) and H = 0.85 (right). The figure also 



shows the exact theoretical autocovariance and the autocovariance corrected for the finite sample bias of ( 16 1 both for 
the fGn and for its sign. 



slowly varying function. We therefore consider long memory processes whose slowly varying 
function in the autocovariance is well behaved according to Definition [2] We prove the following 

Proposition 5. Let {x(z)}jeN be a discrete time Gaussian weakly stationary process with autoco- 
variance function given by Definition^and L e £,(I,bi,fii). We also assume that in Definition^ 
either I < co, or, ifl = oo, then sup{/3} > 1 - a. Then, the spectral density 4>d(oS) of the discretized 
process {xd(i)}i<m satisfies as co — > + 



<f>d(oj) 



^ 2 (0,<T 1/2 0' 



V^r 



Cpb G \co 



a-\ 



r (i + o(M min(2 ^ i ' 1 - Q-) )) 

(l + O (max (M 1 -* In M" 1 , M 2a ))) 
(l+tflmaxlM^lnM- 1 ,!^ 1 ))) 

(l + olM^lnM- 1 )) 



if a+/3i # l,3ar # 1 

if a +fii = 1,3a £ 1 

if a +fii # 1,3a = 1 

if a +/3\ = 1,3a = 1 



(20) 



where # a (M, q) is the elliptic theta function, and c^ 
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n ! r(l - or) sin(7TQ , /2). 



Note that the above expressions do not give exactly the prefactor of the second order correction, 
but only its order of magnitude. The sign of the prefactor is important to describe the deviation of 
the spectral density from a pure power law and it is also important to estimate the sign of the bias 



in the estimation of the Hurst exponent with the periodogram (see Section 4.1.3). By using the 
results of the asymptotic theory of Fourier transform (see Zygmund (1959)) we are able to compute 
exactly the prefactor of the second-order correction for H > 5/6 (i.e. a < 1/3). Unfortunately, for 
H < 5/6 the theory gives us only the order of magnitude. 

As we have mentioned above, an important case is given by long memory processes whose 
slowly varying function is analytical at infinity. This class includes the fGn and the fARIMA 
process. In these cases the conditions of Proposition [5] hold and we prove the following corollary: 

Corollary 2.IfLe Rq and L is analytic at infinity, then 



4>d(to) 






c,pbo \co 



(i-i 



\l+o(\co\ {i ~ a) )) for 1/3 <a< 1 

(l+tflM^lnM- 1 )) for a = 1/3 
(l +o{\co\ 2a )) for 0<a< 1/3 



(21) 



where $ a {u, q) is the elliptic theta function, and c^ = n : r(l - or) sin(o7r/2). 



3.2.2. Sign process 

Similarly to above, we can compute the spectral density of the sign of a Gaussian weakly 
stationary process. 

Proposition 6. Let {x(0}/eN be a discrete time Gaussian weakly stationary process with autoco- 
variance function given by Definition^ The spectral density (p s (oo) of the sign process {sign (x(i))}<eN 
satisfies 

' 2 



lim (p s (io)l 

<A>— >0 + 



— cvM^L^- 1 ) 
nD 



1 



(22) 



where Ca, = n T(l - a) sin(;ra72). 



Proposition 7. Let {x(/)}, e N be a discrete time Gaussian weakly stationary process with autoco- 
variance function given by Definition^and L e £,(I,bi,j3j). We also assume that in Definition^ 
either I < oo, or, if I = oo, then sup{/3} > 1 -a. Then, the spectral density (p s {oS) of the sign process 
[sign (x(/))),eN satisfies the same equations as the spectral density (f>d(^>) of the discretized process 

' ' replaced by -|j. 



{Xd(i)}ieN i n Proposition 
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Corollary 3. If L e %$ and L is analytic at infinity, then the spectral density s (u)) satisfies the 



same equations as the spectral density (f>d(oj) in Corollary 



with 



ff 2 (0,e~ l/2 



replaced by ' 



nD' 
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3.2.3. The spectral density of the discretization offGn andfARIMA processes 

The fGn and the fARIMA process are often used in modeling Gaussian long memory pro- 
cesses. Starting from the explicit functional form of the spectral density of either of these two 
processes it is direct to show (see Beran (1994)) that the function L(k) is analytical. Moreover, by 
factoring out the 0(\a>\ a ~ l ) term, the (relative) second order term of the spectral density is 0(co 2 ), 
which is a very small correction to the leading term of order 0(1) even for relatively large fre- 
quencies. Hence, roughly speaking, on a log-log plot the spectral density of both the fGn and the 
fARIMA process is close to a straight line even for relatively large frequencies. 

On the other hand, the two corollaries above state that for the discretization (or for the sign) of 
a Gaussian long memory process with analytical L(k) the relative second order correction to the 
spectral density around + is always larger than O ( M 2/3+£ ) for any s > which is a relatively large 
correction even for small frequencies. On a log-log plot this leads to a significant deviation from 
the straight-line behavior (leading term) even for relatively small frequencies. By using the theory 
of asymptotic Fourier transforms (Zygmund (1959)) we can compute explicitly the prefactor of 
the second order correction of the spectral density of a discretization of a fGn and of a fARIMA 
process when H > 5/6 (i.e. a < 1/3). In both cases we find that the prefactor is positive. As 
mentioned above, we are not able to compute it analytically when H < 5/6. As we will show 
below, these observations have important consequences for the Hurst exponent estimators based 
on the periodogram, which is an estimator of the spectral density. 

In Figures [4] and [5] we show examples of the sample periodogram of the discretized fGn 
(£ = 0.1) and of its sign, respectively, for different time series length and Hurst exponent. As 
expected, while the periodogram of the fGn is very straight in a log-log scale, the periodogram 
of the discretized process, as well as the periodogram of the sign process, changes its slope for 
increasing frequencies. For both the discretized process and the sign process the absolute slope 
of the periodogram decreases for increasing frequencies, both when H > 5/6 and when H < 5/6. 



As we will show in Section 4. 1 , analytical calculations and numerical simulations indicate that the 



prefactor of the second order correction is always positive. 

4. Estimation of the Hurst exponent 

In this section we investigate the estimation of the Hurst exponent from finite time series of 
the discretized process. We numerically generate time series of discretized long memory pro- 
cesses and we compute the Hurst exponent by using the log-periodogram regression and the DFA. 
Despite the fact that, as we have demonstrated in the previous sections, the Hurst exponent of 
the discretized process is the same as the one of the original process, we will show here that both 
methods give estimates of the Hurst exponent which are systematically and significantly negatively 
biased. 
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Figure 4: Sample periodogram of a numerical simulation of a fGn and its discretization with a scaling parameter 
£ = 0.1. The time series has length N = 2 10 (top) and N = 2 14 (bottom). The Hurst exponent of the fGn is H - 0.7 
(left) and H - 0.85 (right). The figure also shows the leading term of the expansion of the spectral density for small 

Cl>. 



4.1. Periodogram 

4.1.1. Definition and notation 

One of the most common estimators of the spectral density is the periodogram. Given a finite 
time series x(i) with i = 1 , . . . , N, the periodogram is 



I(coj) 



1 

2dV 



YjX(ky 



iaijk 



k=l 



7=1,. ...M 



(23) 



where coj = 2nj/N and M is a positive integer equal to the integer part of (N - I) 12. 

The spectral density of weakly stationary Gaussian long-memory processes is often rewritten 
as 

0(w) = |1 - e-^r^fico) (24) 
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Figure 5: Sample periodogram of a numerical simulation of a fGn and its sign. The time series has length N = 2 10 
(top) and N = 2 14 (bottom). The Hurst exponent of the fGn is H - 0.7 (left) and H = 0.85 (right). The figure also 
shows the leading term of the expansion of the spectral density for small at. 



where (f>*{-) is an even, positive, continuous function on [-n, n], bounded above and bounded away 
from zero. The function (p* determines the high-frequency properties of the series, i.e. the short 
term correlation structure. 

Geweke and Porter-Hudak (1983) (GPH) proposed a semiparametric estimator of d (or H) 
given by the least squares estimate of the slope parameter of the ordinary linear regression of the 
first m values of log Iicjj) on the explanatory variable Xj = log 1 1 - e~ ,ajj \ = log \2 sin(o; 7 /2)|. The 
regression model is 



log I(coj) = log <f (0) - 2dXj + log(f((Oj)/f(0)) + Sj 1 < j < m < M 



(25) 



where e 7 = log \I{cOj)l(p{ojj)\. The rationale behind this estimator is based on two assumptions: 
(i) Sj are approximately uncorrected and homoskedastic and (ii) the term log(0*(o» i )/0*(O)) is 
negligible for small coj < co m . From the second assumption it is clear that the choice of m is 
an important practical problem in the implementation of the GPH estimator. This choice entails a 
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bias-variance tradeoff. Using only the m lowest Fourier frequencies in the regression helps to avoid 
biases due to the possible non-constancy of log(0* \co j) / '<p* (0)1, but increases the variance of the 
estimate. Geweke and Porter-Hudak suggested the choice of the upper bound m GPH = N 1/2 . They 
found that this choice gave good results in numerical simulations, but they gave no theoretical 
basis for this choice. 

Agiakloglou, Newbold and Wohar (1993) performed Monte Carlo simulations and showed that 
the GPH estimator can be badly biased on moderate- sized samples. They argued that the bias is 
caused by dropping the term log (<p*(a)j)/<p*(0)) in the regression model proposed by Geweke and 
Porter-Hudak. 

Moreover, subsequent studies showed that Sj are asymptotically neither independent nor iden- 
tically distributed when N — » oo and j stays fixed (see Kiinsch (1986), Hurvich and Beltrao 
(1993), Robinson (1995)). Independently and around the same time, Robinson (1995) and Hurvich 
and Beltrao (1993) proved that the periodogram I(a)j), when evaluated at the Fourier frequencies 
(which depend on N), suffers from an asymptotic relative bias as an estimator of (p{coj). Numeri- 
cal evaluations in Hurvich and Beltro (1993) showed that the asymptotic relative bias is typically 
positive, monotonically increasing in \d\, and monotonically decreasing in jq Thus the asymptotic 
relative bias is positive and more significant for the first Fourier frequencies, which are the most 
important frequencies for estimating d. As we will discuss below, the extent to which this bias 
affects the estimation of the memory parameter depends on the growth rate of m with respect to 
N. As far as we know, most of the literature only considered either Gaussian processes (Hurvich 
and Beltrao (1993), Robinson (1995), Hurvich et al. (1998)) or linear processes (Velasco (2000)), 
establishing conditions on m such that the log-periodogram estimate of J is a consistent estimator. 

Under Gaussianity assumption Hurvich, Deo, and Brodsky (HDB) (1998) proved consistency 
and asymptotic normality of the estimator as long as m — » oo and N — » oo with (m log m)/N — » 0. 
Under these and additional regularity conditions, they find the asymptotic optimal choice for m 
in order to minimize the mean square error of the estimator. The additional regularity conditions 
are that (p* is bounded above and bounded away from zero, it is three times differentiable with 
first derivative </>*'(0) = 0, and second and third derivatives bounded in a neighborhood of zero. If 
0*"(O) # 0, they proposed the following optimal m 

Hurvich et al. (1998) considered fARIMA processes, which satisfy these regularity conditions. 
However, for example, the optimal m HDB is not defined for a f ARIMA(0, d, 0) process, because for 
this process 0*"(O) = 0. Moreover, these regularity conditions are quite restrictive. The fGn, for 
example, does not satisfy therry 



3 The only exception is a small closed interval around d — where asymptotic relative bias is slightly smaller than 
unity, and for fixed d it is decreasing in j. In any case, negative asymptotic relative bias, when it does occur, is of such 
a small magnitude that it would be of negligible consequence in practice. 

4 In fact, it is easy to show that for the fGn <p*{oS) - 1 + c\ u> 2 + C20> 2(1+ * + O (w 3+M ), where c\ < and c 2 > 0. The 
third derivative of this function is not uniformly bounded in a neighborhood of zero, because of the term <y 2(1+d) . 
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We are concerned here with the periodogram estimation of the Hurst exponent of the dis- 
cretized process which is neither Gaussian nor linear. Therefore our processes do not satisfy the 
assumptions of Hurvich et al. (1998) because they are non-Gaussian and because, as we show 
below, (p* of the discretized process does not satisfy the HDB regularity conditions. For these 
reasons, in the following numerical simulations we take different values of the threshold, namely 
m = N XI2 ,N 3I5 ,N AI5 . In other words, we consider the GPH and the HDB threshold (even if there 
is no theoretical justification) and an intermediate threshold. 

4.1.2. Numerical simulations 

We numerically generated the Gaussian process as a fGn and we considered two values of the 
Hurst exponent, namely H = 0.7 and 0.85. This choice of Hurst exponents is motivated by the 
need to have values below and above the critical value H = 5/6. In order to test the dependence 
on the time series length we considered series of length TV = 2 10 and N = 2 14 . We then applied 
four discretization procedures corresponding to discretization parameter f = 0.1, 0.25, 0.5 and the 
sign of the process. We performed L = 10 3 simulations in order to obtain the statistical properties 
of the estimators. 

The results of the estimation over the L = 10 3 simulations are summarized in Tables Q] and 
|2j The tables show the mean, standard error, and three quantiles (2.5%, 50%, and 97.5%) of the 



estimator H of the Hurst exponent obtained by using the Geweke-Porter Hudak regression (25) 
with the three thresholds, namely m = N 112 , N 3/5 and N 4/5 . 

For the continuous process, the periodogram has a positive asymptotic relative bias as estimator 
of the spectral density and this bias is monotonically decreasing in j (Kunsch (1986), Hurvich and 
Beltrao (1993), Robinson (1995)). In our results this is evident from the slight overestimation 
of H for the continuous process with both N = 2 10 and N = 2 14 , especially for m = /V 4/5 . For 
the discretized process we observe that the negative bias due to the discretization increases with 
m, as expected by Proposition [8] (see below). The GPH estimator (m = N l/2 ) is less biased by 
discretization than the estimator with threshold m = N 4/5 . However, as pointed out in Hurvich et al. 
(1998), the GPH estimate has a large dispersion and is very noisy. These properties are somewhat 
expected from the bias variance tradeoff. Looking more carefully at the estimators with m = N 3 ^ 5 
and m = N 4/5 we observe that for all values of N and H, H of the discretized series is significantly 
negatively biased. Specifically, when H = 0.7 we observe a severe underestimation of the Hurst 
exponent for the sign and for £ = 0.1. When H = 0.85 we observe a severe underestimation of 
the Hurst exponent for the sign and for £ = 0.1,0.25. This problem is still evident for long time 
series. Obviously, in this case the standard error declines, but the finite sample negative bias is not 
significantly reduced. For example, for m = N 4/5 and N = 2 14 the 97.5% quantile is smaller than 
(or approximately equal to) the true H for the sign and for £ = 0. 1, 0.25, both for H = 0.7 and for 
H = 0.85. We also note that the effect of the bias due to the discretization is increasing with the 
long memory parameter, i.e. the larger is H, the larger is the discretization bias. Finally, note that 
the bias is a finite size effect. 

We ran the same simulations for a fARIMA(0, d, 0), and we got similar results. 
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m = N 1/2 

2.5% 

Mean(fl) 50% 

(Std. Err.(//)) 97.5% 


m = A^ i/5 

2.5% 

Mean(H) 50% 

(Std. Err.(H)) 97.5% 


m = JV 4/3 

2.5% 

Mean(fl) 50% 

(Std. Err.(H)) 97.5% 


N - 2 1 " 


Sign 


0.6764 
(0.0043) 


0.3916 
0.6764 
0.9333 


0.6687 
(0.0027) 


0.4949 
0.6696 
0.8359 


0.6596 
(0.0013) 


0.5784 
0.6608 
0.7357 


C = 0.1 


0.6491 
(0.0041) 


0.3703 
0.6523 
0.8884 


0.6398 
(0.0028) 


0.4448 
0.6428 
0.8036 


0.6231 
(0.0014) 


0.5417 
0.6228 
0.7044 






0.4231 




0.4817 




0.5933 



( = 0.25 
C, =0.5 



Continuous 



Sign 
C = 0.1 



Continuous 



0.6880 
(0.0042) 

0.6904 
(0.0043) 



0.6968 
0.9357 
0.4061 
0.6997 
0.9553 



0.7035 
(0.0043) 



0.4065 
0.7066 
0.9605 



0.6799 
(0.0029) 

0.6914 
(0.0028) 



0.6869 
0.8431 
0.4993 
0.6939 
0.8497 



0.7019 
(0.0028) 



0.5179 
0.7035 
0.8619 



0.6731 
(0.0013) 

0.6939 
(0.0013) 



0.6731 
0.7543 
0.6125 
0.6953 
0.7685 



0.7142 
(0.0013) 



0.6291 
0.7158 
0.7943 



N = 2 



vr 



0.6823 
(0.0020) 

0.6685 
(0.0019) 



0.5609 
0.6821 
0.7941 
0.5483 
0.6703 
0.7819 



0.6765 
(0.0011) 

0.6577 
(0.0011) 



0.6051 
0.6777 
0.7426 
0.5876 
0.6583 
0.7287 



0.6614 
(0.0004) 

0.6322 
(0.0004) 



0.6355 
0.6612 
0.6856 
0.6045 
0.6322 
0.6609 



<T = 0.25 


0.6914 
(0.0019) 


0.5759 
0.6898 
0.8053 


0.6872 
(0.0011) 


0.6189 
0.6883 
0.7535 


0.6763 
(0.0004) 


0.6475 
0.6765 
0.7025 


C, =0.5 


0.6949 
(0.0019) 


0.5705 
0.6961 
0.8084 


0.6937 
(0.0011) 


0.6226 
0.6943 
0.7599 


0.6899 
(0.0004) 


0.6614 
0.6904 
0.7167 






0.5817 




0.6279 




0.6763 



0.7006 
(0.0019) 



0.7007 
0.8133 



0.7011 
(0.0011) 



0.7012 
0.7682 



0.7055 
(0.0004) 



0.7063 
0.7312 



Table 1 : Estimation of the Hurst exponent of a fGN with H — 0.7 and its discretization with the sign function and with 
£ — 0.1, 0.25, and 0.5. The time series has length 2 10 (top) and 2 14 (bottom). The estimation of the Hurst exponent is 
performed with the periodogram by using thresholds m - N 1 ^ 2 , m = A^ 5 , and m = N 4 ^ 5 (see text). The table reports 
the mean value of the estimator H over 10 3 numerical simulations, together with the standard error and the 2.5%, 
50%, and 97.5% percentile. 



4.1.3. The bias of the log-periodogram estimator of H for the discretized process 

We give now some theoretical arguments that helps understanding the possible origin of esti- 
mation biases and their sign when the regression of the periodogram is used to estimate the Hurst 
exponent of the discretized process. 

We follow here the notation of Hurvich et al. (1998). Let d be the log-periodogram regression 
estimate of the true long memory parameter d. We know that 



d 



1 



1 



d = > a,- log 01 / a i £ i 



7=1 



25, 



18 





m = N 1/2 

2.5% 

Mean(fl) 50% 

(Std. Err.(//)) 97.5% 


m = A^ i/5 

2.5% 

Mean(#) 50% 

(Std. Err.(H)) 97.5% 


m = JV 4/3 

2.5% 

Mean(tf) 50% 

(Std. Err.(//)) 97.5% 


/V-2 i!l 


Sign 


0.8175 

(0.0041) 


0.5596 
0.8201 
1.0652 


0.8065 
(0.0029) 


0.6272 
0.8094 
0.9785 


0.7794 
(0.0014) 


0.6914 
0.7796 
0.8636 


C = 0.1 


0.7781 
(0.0046) 


0.5068 
0.7811 
1.0757 


0.7603 
(0.0031) 


0.5582 
0.7586 
0.9542 


0.7291 

(0.0017) 


0.6304 
0.7284 
0.8304 


C = 0.25 


0.8317 
(0.0042) 


0.5739 
0.8393 
1.0805 


0.8225 
(0.0027) 


0.6462 
0.8261 
0.9769 


0.8027 
(0.0013) 


0.7236 
0.8031 
0.8838 


C, =0.5 


0.8477 
(0.0042) 


0.5801 
0.8492 
1.0961 


0.8403 
(0.0028) 


0.6660 
0.8440 
1.0038 


0.8336 
(0.0013) 


0.7529 
0.8334 
0.9159 


Continuous 


0.8562 
(0.0042) 


0.5798 
0.8606 
1.1071 


0.8566 
(0.0028) 


0.6751 
0.8585 
1.0199 


0.8738 
(0.0013) 


0.7919 
0.8745 
0.9535 


JV = 2 14 


Sign 


0.8379 
(0.0019) 


0.7142 
0.8388 
0.9471 


0.8244 
(0.0012) 


0.7512 
0.8264 
0.8927 


0.7885 
(0.0004) 


0.7618 
0.7892 
0.8132 


C, =0.1 


0.8186 
(0.0019) 


0.6981 
0.8218 
0.9280 


0.7990 
(0.0012) 


0.7195 
0.8002 
0.8705 


0.7472 
(0.0005) 


0.7152 
0.7466 
0.7807 


C = 0.25 


0.8467 
(0.0019) 


0.7305 
0.8478 
0.9588 


0.8377 
(0.0011) 


0.7651 
0.8382 
0.9062 


0.8095 
(0.0004) 


0.7825 
0.8096 
0.8365 


C, =0.5 


0.8508 
(0.0019) 


0.7298 
0.8518 
0.9623 


0.8450 
(0.0012) 


0.7724 
0.8461 
0.9129 


0.8313 
(0.0004) 


0.8053 
0.8310 
0.8584 


Continuous 


0.8548 
(0.0019) 


0.7345 
0.8552 
0.9608 


0.8525 
(0.0012) 


0.7795 
0.8537 
0.9244 


0.8569 
(0.0004) 


0.8305 
0.8570 
0.8828 



Table 2: Estimation of the Hurst exponent of a fGN with H = 0.85 and its discretization with the sign function and 
with f = 0.1, 0.25, and 0.5. The time series has length 2 10 (top) and 2 14 (bottom). The estimation of the Hurst 
exponent is performed with the periodogram by using thresholds m = N 1 ^ 2 , m = N 3 ? 5 , and m - N 4 ^ 5 (see text). The 
table reports the mean value of the estimator H over 10 3 numerical simulations, together with the standard error and 
the 2.5%, 50%, and 97.5% percentile. 



where <f>* = (p*{co } ), a } 



Xj-^XtandS, 



Y, ai. Consequently, the bias of d is given by 
it=i 



-. m -. m 

Md -d] = -^Y.aj log cf>* - — - V ajE[ej] 

xx j=\ xx j=l 



(27) 



This expression shows that the bias is the sum of a contribution coming from the high frequency 
components of the spectral density and a contribution coming from the error terms Sj. 

Hurvich et al. (1998) considered a generic Gaussian long memory process satisfying the reg- 
ularity conditions that <p*'(Q) = 0, \f"{co)\ < B 2 < oo, and \<p*'"(co)\ < B 3 < oo for all a; in a 
neighborhood of zero. In the limit m — » oo, A^ —* oo, with m/N —* and (mlogm)/A/ —> they 
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proved that 

1 A , -2n 2 f"(0)m 2 (m 2 \ 

-^L^^^—^^+o^-y (28) 

This expression gives the bias due to high frequency components of the spectral density. It is 
possible to show that the above result holds also for a generic non-Gaussian process satisfying the 
Hurvich et al. regularity conditions. 

However, this result does not hold for the discretized process. In fact from Proposition [5] we 
know that that, for example, when a\ +/?i ± 1 and 3a # 1 (case (i) of Proposition [5]) we have 

f d ((Oj) = C (l + O (laj.^afiul-^ (29) 

Because mm(2a,fi\, 1 - a) < 1, the first derivative of (p*, does not even exist in co = 0, and both the 
left- and the right-derivative are unbounded in a neighborhood of zero. Obviously, this is true also 
for all the higher-order derivatives. 

Here we generalize the above result of Hurvich et al. (1998) to the discretized process by 
proving the following proposition: 

Proposition 8. Under the conditions of Proposition^] case(i), let a = mm(2a,/3\, I - a) and C 
the pre-f actor in front of the second-order correction. Let e > s. t. the third-order correction 
to spectral density of the discretized process is O (co- +E ). Assume that m — » oo, N — » oo, with 
m/N -* and ( m min( ^ e) logm)/A^ min( ^ £) -^ 0. Then, 

1 f , , Ca(2nf-m^ [j&X 

~ 2^ x L a ^^^ = -20^m +o [m) (30) 

Similar considerations and a straightforward extension of this proposition can be obtained for 
the other cases of Proposition [5} The proof of this and of the other propositions and theorems in 
this section are in | Appendix C| 

Let us restrict now our attention to the discretization of a fGn or of a f ARIMA(0, d, 0) pro- 
cess. This proposition shows that the bias due to log(0*(o> ; )/0*(O)) is much larger for the dis- 
cretized process than for the original process. In fact the rate of decay of the bias as m, Af — » oo is 
much smaller for the discretized process ((m/N)-) than for the original process ((m/N) 2 ). More- 
over, the discretized process needs a stricter trimming condition ((m min( - £) logm)/7V mm( - £) — > 0) 
than the original process ((m\ogm)/N — > 0). We also mentioned above that for a fGn or for a 
f ARIMA(0, d, 0) process the prefactor C of the second order correction of the spectral density is 
positive when H > 5/6. This means that in this regime the bias due to the high frequency com- 
ponents is negative. For H < 5/6 we are not able to compute the sign of C. However, from our 
Monte Carlo simulations we observe that the log spectral density is strongly upward sloping at 
high frequencies for both H > 5/6 and H < 5/6. This suggests that the prefactor C is positive for 
all H. From these empirical observations and from Proposition [8} we conjecture that the bias due 
to the high frequency components is negative for all H. 

The total bias on d has also a contribution due to the errors Sj. Under Gaussianity Robinson 
(1995) and Hurvich et al. (1998) proved that E[sj] = 0(log j/j). Velasco (2000) in the proof of his 
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Theorem 1 obtained similar results under the assumption that the process is lineaifj However, the 
discretized process is neither Gaussian nor linear and thus we cannot use these results. Moreover, 



we do not have an estimation of the second term in the right hand side of (27 1. From Jensen's 
inequality we have E[sj] < logE \l(ojj)/<p(a>j) , and from Hurvich and Beltrao (1993) and Robin- 
son (1995) we know that asymptotically E 7(a» ; -)/0(a» y -) is strictly greater than 1. Therefore, the 
bias due to Sj may be either positive or negative. This will depend on the distribution function of 
Sj, which is nontrivial for the discretized process. For these reasons we infer the behavior of the 
second term from the numerical simulations. 

In the previous section we have seen that simulations show that the periodogram estimator of 
d is negatively biased. Note that this is different from the Gaussian case (see for example Agiak- 
loglou et al. (1993)) where the periodogram is positively biased. From this empirical observation 
we can make two possible conjectures. Either the bias due to Sj is also negative and thus the 
total bias is also negative, or the bias due to Sj is positive but in absolute value smaller than the 
(negative) bias of the high frequency components, and thus the total bias is negative as observed 
in numerical simulations. 

More in depth analyses are needed to clarify this point. Our proposition [8] gives a bound of the 
high frequency component of the bias and shows that it slowly decays to zero. This means that it 
likely has an important role in the total bias of the periodogram. 

4.2. Detrended Fluctuation Analysis 
4.2.1. Definition and notation 

We now consider the Detrended Fluctuation Analysis (DFA) (see Peng et al. (1994)), a method 
to investigate the properties of a long memory process and to estimate the Hurst exponent. This 
method was introduced more than fifteen years ago to investigate physiological data, in particular 
the heartbeat signal. Since its introduction it has been applied to a large variety of systems, includ- 
ing physical, biological, economic, and technological data. The idea is to consider the integrated 
process and detrend it locally. The scaling of the fluctuations of the residuals as a function of the 
box size in which the regression is performed gives the estimate of the Hurst exponent. More pre- 
cisely, let [x(l), . . . x(N)} be a finite time series and denote the discrete integration of this sample 
as 

k 

y(k) = Y J *(i) (3D 

The integrated time series is divided into [N/m] boxes of equal length m, where [z] is the 
integer part of z. For j e {1, . . . , [N/m]}, let us define the vector: 

y (j) = (y(\+m(j-\)),...,y(mj)y (32) 

In each box, a least squares line is fit to the data (representing the trend in that box). The y 
coordinate of the straight line segments is denoted by y m (k). Next, one detrends the integrated time 

5 Note, however, that Velasco considered the pooled log-periodogram. 
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series, y(k), by subtracting the local trend, y m (k), in each box. The root-mean-square fluctuation of 
this integrated and detrended time series is calculated by 



F(m) 



\ m ■ [N/m] 



. m-[N/m] 

T77-T y iy(k)-y m (k)] 2 . (33) 



k=\ 



This computation is repeated over all time scales (box sizes) to characterize the relationship be- 
tween F(m) and the box size m. Typically F(m) increases with box size m. For a long memory 
process the Hurst exponent is given by the relation F(m) oc m H . Therefore H is estimated by per- 
forming a log-regression of F(m) versus m. The proposers of this method claim that DFA is able 
to perform well also in the presence of non-stationarities, such as trends, (see Peng et al. (1994)), 
even if some recent results dispute this claim (see Bardet and Kammoun (2008)). 
The partial DFA function computed in the j'-th window of size m is 

F)(m) = - V (y(i)-y m (i)) 2 (34) 

i=m(j-l)+l 

for j 6 {1, . . . , [N/m]}. Then, we can write 

[N/m] 

F2(m) = FXTT-f E F ' (m) (35) 

[N/m\ £—^ J 

Recently, Bardet and Kammoun (2008) (Lemma 2.2) showed that for a stationary process 
the series {Fj(m)}i<j< [N / m] is a stationary process for all m. This means that in order to study the 
asymptotic statistical properties of the DFA we can focus on F^(m) only. They then used this result 
to provide an asymptotic expression (Theorem 4.2) for E LFj(m) for a general class of Gaussian 
long memory processes. Here we extend this result to non-Gaussian long memory processes (see 
Theorem [T] below). In doing that we show that the order of the correction of the asymptotic 
expansion given in Bardet and Kammoun (2008) for generic Gaussian processes (Theorem 4.2) is 
incorrect and we provide the correct order. For the special case of fGN and f ARIMA the result 
provided in Bardet and Kammoun (2008) is correct because of the cancellation of the prefactor of 
the correction term observed for a generic long memory process. 

4.2.2. A theorem on the detrended fluctuation analysis of a general long memory process 

Here we generalize Theorem 4.2 of Bardet and Kammoun (2008) to a general class of non- 
Gaussian long memory processes. The discretized process belongs to this class. 
We prove the following: 

Theorem 1. Let {x(/)} i(E N be a weakly stationary long memory process, with zero mean, finite 
variance, and the following autocovariance function 

y(k) = Ak 2H ~ 2 [\ + (r^)) for k -> oo , (36) 
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with A > 0, 1/2 < H < 1, and (3 > 0. Then, 

L ! J |(l + tffrn 1 - 2 " In m)) if fi = 2H - 1 

where D' = — - — and f(H) = — 

First of all, note that the class of processes for which one can apply this theorem is more 
general of those of Definition [2| but clearly less general than those of Definition [T] Second, our 
theorem gives a different order of the correction compared to the one given in Theorem 4.2 of 
Bardet and Kammoun (2008). In fact, Bardet and Kammoun give 0(m~ mm(1 ^ ) ), while our theorem 
gives 0(m~ mm{2H ~ l 'P ) ). In many cases of interest (for example when/3 > 1) the leading error term 
is 0(l/m 2ff-1 ) rather than 0(l/m) as stated in Bardet and Kammoun (2008). 

For fGn or f ARIMA(0, d, 0) we observe a specific behavior. In fact, in these cases the error 
term is 0(m~ l ), instead of 0(m~ mm( - 2H ~ 1 ^), because the prefactor of the terms of order 0(m l ~ 2H ) 
cancels out exactly, and/3 = 2. This depends on the specific functional form of the autocovariance 
of these processes. Therefore, as stated in Bardet and Kammoun (2008), Property 3.1. we have: 

Corollary 4. For a fractional Gaussian noise (fGn) it is 

E [F 2 (m)] = Df(H)m 2H (\ + 0(m- [ )) (38) 

The general Theorem [T] gives the asymptotic properties of the fluctuation function for a time 
series obtained by discretizing or taking the sign of a Gaussian process. The root mean square 
fluctuations for discretized processes are given by the following two corollaries 

Corollary 5. For the discretized process of a Gaussian time series with the properties of Theorem 
\l\it is 



E[F 2 (m)] 



V2^r 



D'f(H)m 



2H 



(l + 0(m l ~ 2H )) 


if 1/2 <H <5/6 


(l +0(m~ 2/3 lnm)) 


if H = 5/6 (39) 


(l+0(m 4H - 4 )) 


if 5/6 < H < 1 



where fidu, q) is the elliptic theta function, and D' and f(H) are defined as in Theorem^ 
Corollary 6. For the sign of a Gaussian time series with the properties of Theorem\l\E,\F 2 (m)\ 



satisfies the same equations as in Corollary 



with I 'j — J replaced by Jj. 



4.2.3. Numerical simulations 

We then considered the performance of the DFA as an estimator of the Hurst exponent on 
finite time series. We considered the same parameters as in the analysis of the performance of the 
log-periodogram regression. Hence we discretized a fGn with H = 0.7 and 0.85 and we generated 
L = 10 3 time series of length TV = 2 10 and 2 14 . We then applied four discretization procedures 
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corresponding to discretization parameter f = 0.1, 0.25, 0.5 and the we considered the sign of the 
process. Following a standard practice, we consider values of m ranging from m = 4 to m = N/4. 

Figures [6] and [7] show an example of the root mean square fluctuation F(m) as a function of 
the block size m. As in the case of the periodogram, a different behavior is observed for the 
Gaussian process and for its discretization. For the former F(m) is well described by a power 
law with exponent H over the whole range of investigated block sizes m. On the contrary, for the 
discretized (or sign) process, F(m) changes significantly slope in the log-log scale, i.e. it has a 
varying local power law behavior. Only for large box sizes the function F(m) converges to the 
expected asymptotic behavior from above. This again suggest a significant negative bias in the 
estimation of H on finite samples, bias that we quantitatively observe in estimation below. In fact, 
in order to estimate the Hurst exponent through DFA one needs to perform a best fit of F with a 
power law function. The ambiguity is however the interval of values of m where one performs the 
fit. To the best of our knowledge there is no rule for selecting optimally such an interval. One 
expects to obtain a less biased, but noisier, estimate of H by performing the fit in a small region 
corresponding to large values of the block size m. To investigate this point we estimate the Hurst 
exponent by performing the fit over a fraction q of the largest values of log 10 [m]. We consider 
three values of q, namely q = 1 (the whole interval), q = 0.5 (the largest half), and q = 0.25 (the 
largest quartile). 

The results of our analysis are summarized in Table [3] and [4] When q = 1 we observe an 
underestimation of H for the discretized series. This underestimation is always significant in terms 
of standard errors. Specifically, when N = 2 10 there is a severe underestimation for the sign and 
for C, = 0.1,0.25, both when H = 0.7 and when H = 0.85. For long series (N = 2 14 ) we observe 
a severe underestimation for the sign and for £ = 0.1, both when H = 0.7 and when H = 0.85. 
Note that for TV = 2 14 the 97.5% quantile of H is smaller than the true value of H for the sign and 
for £ = 0.1, both when H = 0.7 and when H = 0.85. By changing q we observe similar results. 
For short samples the DFA estimate with q = 1 performs better than that with q = 0.25. This 
might be due both to the fact that points corresponding to large m are very noisy and to the fact 
that the fluctuation function converges to its asymptotic value from above. Finally, note that for 
the discretized process is not always true that the less biased estimator is obtained for small values 
of q. In fact, in most of the cases we observe quite the contrary. This might be due to the shape 
of the root mean square fluctuation function F that changes slowly slope when the block size m 
varies (see Figures [6] and [7]). 

4.3. Discussion on the estimation of the Hurst exponent of a discretized process 

In conclusion our analytical considerations and numerical simulations show that both the log- 
periodogram regression and the DFA consistently give negatively biased estimates of the Hurst 
exponent H (or of the fractional differencing parameter d) when they are applied to discretized 
processes. This is in contrast with what happens when the log-periodogram regression is used to 
estimate the Hurst exponent of a Gaussian process. In fact, a large body of literature showed that 
for Gaussian processes the log-periodogram regression gives a positively biased estimate of the 
Hurst exponent. Our results show also that the size of the negative bias of H for a discretized 
process can be significant due to the fact that the second order correction of the spectral density or 
of the root mean square fluctuation F of the DFA is large for a discretized process. The negative 
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q = 0.2* 

Mean(//) 
(Std. Erv.(H)) 


2.5% 

50% 

97.5% 


4 = 0.5 

Mean(/7) 
(Std. Ett.(H)) 


2.5% 

50% 

97.5% 


q = \ 

Mean(fl) 
(Std. Err.(#)) 


2.5% 

50% 

97.5% 


N - 2 1 " 


Sign 


0.6462 
(0.0051) 


0.3548 
0.6422 
0.9650 


0.6477 
(0.0029) 


0.4683 
0.6488 
0.8312 


0.6558 
(0.0013) 


0.5765 
0.6563 
0.7304 


C, =0.1 


0.6228 
(0.0049) 


0.3464 
0.6174 
0.9476 


0.6222 
(0.0028) 


0.4591 
0.6198 
0.7946 


0.6259 
(0.0013) 


0.5465 
0.6268 
0.7054 






0.3568 




0.4842 




0.5917 



( = 0.25 
C = 0.5 



Continuous 



0.6612 
(0.0052) 

0.6696 
(0.0052) 



0.6534 
0.9898 
0.3755 
0.6584 
1.0072 



0.6639 
(0.0030) 

0.6724 
(0.0029) 



0.6641 
0.8506 
0.4910 
0.6752 
0.8501 



0.6838 
(0.0034) 



0.6761 
0.9019 



0.6903 
(0.0016) 



0.6898 
0.7882 



0.6712 
(0.0013) 

0.6858 
(0.0013) 



0.7002 
(0.0006) 



0.6725 
0.7478 
0.6062 
0.6883 
0.7589 



Continuous 


0.6797 
(0.0052) 


0.3662 
0.6775 
1.0135 


0.6859 
(0.0029) 


0.5019 
0.6867 
0.8667 


0.7046 
(0.0013) 


0.6254 
0.7055 
0.7807 








N = 


2 14 






Sign 


0.6691 
(0.0034) 


0.4734 
0.6633 
0.8869 


0.6714 
(0.0016) 


0.5717 
0.6709 
0.7703 


0.6655 
( 0.0006) 


0.6279 
0.6660 
0.7003 


C, =0.1 


0.6605 
(0.0034) 


0.4570 
0.6556 
0.8819 


0.6560 
(0.0016) 


0.5547 
0.6543 
0.7547 


0.6421 
(0.0006) 


0.6040 
0.6432 
0.6759 


C = 0.25 


0.6770 
(0.0034) 


0.4726 
0.6726 
0.8992 


0.6801 
(0.0016) 


0.5780 
0.6813 
0.7706 


0.6782 
(0.0006) 


0.6408 
0.6794 
0.7114 


C, =0.5 


0.6796 
(0.0034) 


0.4706 
0.6736 
0.8932 


0.6847 
(0.0016) 


0.5832 
0.6834 
0.7833 


0.6884 
(0.0006) 


0.6511 
0.6890 

0.7242 






0.4761 




0.5892 




0.6633 



0.7012 
0.7345 



Table 3: Estimation of the Hurst exponent of a fGN with H - 0.7 and its discretization with the sign function and with 
f = 0.1, 0.25, and 0.5. The time series has length 2 10 (top) and 2 14 (bottom). The estimation of the Hurst exponent 
is obtained with the Detrended Fluctuation Analysis by performing the least square regression over a q fraction of 
the largest values of log 10 [m] (see text). The table reports the mean value of the estimator H over 10 3 numerical 
simulations, together with the standard error and the 2.5%, 50%, and 97.5% percentile. 
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q = 0.25 

2.5% 

Mean(fl) 50% 
(Std. Err.(H)) 97.5% 


4 = 0.5 

Mean(/7) 
(Std. Err.(#)) 


2.5% 

50% 

97.5% 


q = \ 

Mean(fl) 
(Std. Err.(//)) 


2.5% 

50% 

97.5% 


N - 2 1 " 


Sign 


0.7689 
(0.0059) 


0.4353 
0.7633 
1.1445 


0.7802 
(0.0034) 


0.5704 
0.7789 
0.9898 


0.7735 
(0.0015) 


0.6864 
0.7728 
0.8626 


C, =0.1 


0.7341 
(0.0056) 


0.4106 
0.7236 
1.1188 


0.7357 
(0.0034) 


0.5312 
0.7346 
0.9592 


0.7259 
(0.0017) 


0.6209 

0.7282 
0.8363 


C = 0.25 


0.7909 
(0.0059) 


0.4276 
0.7825 
1.1867 


0.7977 
(0.0034) 


0.5793 
0.7980 
1.0056 


0.7941 
(0.0015) 


0.7031 
0.7947 
0.8801 


f = 0.5 


0.7978 
(0.0059) 


0.4560 
0.7894 
1.1926 


0.8124 
(0.0035) 


0.5891 
0.8154 
1.0265 


0.8195 
(0.0015) 


0.7263 
0.8183 
0.9107 


Continuous 


0.8080 
(0.0060) 


0.4476 
0.8022 
1.2027 


0.8275 
(0.0035) 


0.5959 
0.8323 
1.0393 


0.8508 
(0.0015) 


0.7550 
0.8510 
0.9431 


JV = 2 14 


Sign 


0.8129 
(0.0039) 


0.5686 
0.8148 
1.0528 


0.8193 
(0.0019) 


0.6990 
0.8197 
0.9358 


0.8022 
(0.0007) 


0.7574 
0.8023 
0.8431 


£ = 0.1 


0.7999 
(0.0038) 


0.5576 
0.7989 
1.0358 


0.8009 
(0.0019) 


0.6812 
0.8013 
0.9177 


0.7696 
(0.0007) 


0.7243 
0.7694 
0.8135 


C = 0.25 


0.8199 
(0.0039) 


0.5742 
0.8226 
1.0507 


0.8290 
(0.0019) 


0.7072 
0.8293 
0.9464 


0.8184 
(0.0007) 


0.7741 
0.8186 
0.8601 


f = 0.5 


0.8218 
(0.0039) 


0.5704 
0.8238 
1.0557 


0.8331 
(0.0019) 


0.7103 
0.8321 
0.9512 


0.8331 
(0.0007) 


0.7891 
0.8334 
0.8750 






0.5723 




0.7121 




0.8033 



Continuous 



0.8232 
(0.0039) 



0.8242 
1.0605 



0.8367 
(0.0019) 



0.8366 
0.9554 



0.8496 
(0.0007) 



0.8501 
0.8909 



Table 4: Estimation of the Hurst exponent of a fGN with H - 0.85 and its discretization with the sign function and 
with f = 0.1, 0.25, and 0.5. The time series has length 2 10 (top) and 2 14 (bottom). The estimation of the Hurst 
exponent is obtained with the Detrended Fluctuation Analysis by performing the least square regression over a q 
fraction of the largest values of log 10 [m] (see text). The table reports the mean value of the estimator H over 10 3 
numerical simulations, together with the standard error and the 2.5%, 50%, and 97.5% percentile. 
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Figure 6: Root mean square fluctuation F(m) as a function of the block size m of a numerical simulation of a fGn and 
its discretization with a scaling parameter £ = 0.1. The time series has length N = 2 10 (top) and N = 2 14 (bottom). 
The Hurst exponent of the fGn is H — 0.7 (left) and H - 0.85 (right). The figure also shows the leading term of the 
expansion of the F(m) for large m (dashed lines). 



bias can be partly overcome by taking small values of m (for the log-periodogram regression) or 
small intervals for the fit of the root mean square fluctuations (for the DFA). However, in both cases 
the variance of the estimators become large and thus the estimator is not significantly reliable. 

When we compare the log-periodogram regression and the DFA we find that in terms of bias 
the best estimator of H is the log-periodogram with GPH threshold. The DFA estimator is very 
sensitive to the window chosen for the regression. The plot of the root mean square fluctuation 
might help in determining a suitable window, but the choice seems quite arbitrary and thus this 
method is more suited for an exploratory analysis. 

Despite the fact that we have presented results only for the fGn and fARIMA(0, d, 0) pro- 
cesses, we have also run extensive Monte Carlo simulations for stationary fARIMA(l,<i, 0) and 
fARIMA(0, d,\) processes. The results are similar to those obtained for fGn and fARIMA(0, d, 0), 
with a statistically significant underestimation of the Hurst exponent for the sign process and for 
coarse discretizations, both for the log-periodogram regression and for the DFA method. However, 
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Figure 7: Root mean square fluctuation F(m) as a function of the block size m of a numerical simulation of a fGn and 
its sign. The time series has length N = 2 10 (top) and N = 2 14 (bottom). The Hurst exponent of the fGn is H — 0.7 
(left) and H - 0.85 (right). The figure also shows the leading term of the expansion of the F(m) for large m (dashed 
lines). 



for stationary fARIMA(l,<i, 0) and f ARIMA(0, d, 1) the size of the underestimation depends on 
the high-frequency behavior of the spectral density of the original process. If the coefficient of the 
characteristic polynomial is positive, the size of the underestimation is smaller than for fGN and 
fARIMA(0, d, 0). If the coefficient of the characteristic polynomial is negative, the size of the un- 
derestimation is larger than for fGN and f ARIMA(0, d, 0). In terms of log-periodogram regression 
these results are intuitive. In fact, when we estimate the Hurst exponent of the continuous-valued 
fARIMA(l,<i, 0) or f ARIMA(0, d, 1) process, if the coefficient of the characteristic polynomial 
is positive, the bias due to the high-frequency component of the spectral density is positive. On 
the other hand, if the coefficient of the characteristic polynomial is negative, the bias due to the 
high-frequency component of the spectral density is negative. When we consider the discretized 
(or the sign) process these biases combine with the effect of the discretization, which is negative. 
Therefore, if the coefficient is positive the two effects have opposite signs and the total bias on the 
estimate of the Hurst exponent is smaller. On the other hand, if the coefficient is negative the two 
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effects have the same sign and the total bias on the estimate of the Hurst exponent is larger. For 
the sake of brevity, we do not report these results here, but they are available from the authors by 
request. 

5. Discussion and concluding remarks 

5.1. Relation to the measurement error literature 

As mentioned in the introduction, round-off error can be considered a form of measurement 
error, even if the type of measurement error typically considered in the literature is different, being 
modeled as a white noise uncorrelated with the latent process. On the other hand, the round-off 
error is a deterministic function of the latent process itself, and it is neither uncorrelated with the 
latent process nor a white noise. 

Recently, in the context of short-memory processes, Hansen and Lunde (2010) have studied 
the effect of sampling errors on the dynamic properties of an underlying ARMA(p, q) time series. 
They have proved that the estimates of both the persistence parameter and the autocorrelation 
function are negatively biased by the measurement error. Our paper contributes in this direction 
for the case of long-memory processes subject to round-off errors. In fact, we are able to compute 
exactly the asymptotic scaling factor between the autocorrelation function of the realized process 
and that of the latent process. This scaling factor is a function only of the adimensional parameter 
£, which in turn depends on the grid size (5) and the variance of the underlying series (D). In most 
cases of interest the researcher knows the level of discretization. In principle, if the researcher 
knew also the variance of the underlying process, then she could estimate the autocorrelation 
function of the latent variable exactly for large lags. The same holds true for the spectral density 
at small frequencies and the root mean square fluctuation function for large box sizes. In general, 
however, the variance of the underlying process is not known ex ante, and the researcher needs 
to make inference on it. Even if the problem of the inference on £ (or D) is outside the scope of 
this paper, it is worth mentioning that by considering the fraction q of points of the discretized 
process with value zero it is possible to infer the value of the variance of the underlying process. 
In fact, we showed that q = erf [6/2 V2D] and therefore the knowledge of the grid size 5 and the 
measurement of qo allow to infer D. Finally, note that other methods could be used to estimate 
D, such as, for example, by comparing the autocovariance of the discretized process with two 
different values of £. 

In the context of long memory processes, Hurvich, Moulines, and Soulier (2005) considered a 
process that can be decomposed into the sum of a long memory signal (possibly non- stationary) 
and a white noise, possibly contemporaneously correlated with the signal. They proved that the 
observed series has the same Hurst exponent as the underlying signal, and from numerical simu- 
lations observed that standard estimators of the Hurst exponent, such as the GPH estimator, suffer 
from a negative bias. Hurvich, Moulines, and Soulier (2005) proposed a corrected local Whittle 
estimator, which is unbiased, but has larger asymptotic standard errors. Recently Rossi and San- 
tucci de Magistris (201 1) analyzed the effect of measurement errors on the estimation of the Hurst 
exponent. Specifically, they studied the effect of measurement errors on the estimation of the dy- 
namic properties of the realized variance, as an ex-post estimator of the integrated variance. Rossi 
and Santucci de Magistris (201 1) find that discrete frequency sampling and market microstructure 
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noise induce a finite- sample bias in the fractionally integration semiparametric estimates. Based 
on Hurvich, Moulines, and Soulier (2005) they propose an unbiased local Whittle estimator (with 
larger asymptotic standard errors) that accounts for the presence of the measurement error. How- 
ever, the corrected Whittle estimator proposed by Hurvich, Moulines, and Soulier (2005) and 
Rossi and Santucci de Magistris (201 1) does not apply to the round-off error, because the round- 
off error does not satisfy the standard assumptions on the measurement error. More recently, in 
a working paper, Corsi and Reno (201 1) proposed an indirect inference approach to estimate the 
long memory parameter of a latent integrated volatility series. 

Our paper contributes to this literature by providing the second order term of the spectral 
density and of the root mean square function for the discretized process. A possible extension 
is to identify analytically and numerically the asymptotic region over which the bias on the long 
memory estimates due to these second order corrections becomes negligible. From Proposition [8] 
we know the order of magnitude of the bias on the log-periodogram regression estimate due to the 
high frequency component. A possible extension is to estimate also the bias due to the noise term 



of the regression (see Section 4. 1 ), and then to find a suitable threshold m of Fourier frequencies to 
use in the regression in terms of bias reduction. Similarly, one could use our results on the second 
order correction to the root mean square fluctuation to estimate a suitable threshold q for the DFA 



regression (see Section 4.2 ) in terms of bias reduction. This is outside the scope of the present 



paper and is left as future work. 

5.2. Concluding remarks 

We have presented an extensive analytical and numerical investigation of the properties of a 
continuously valued long memory process subject to round-off error. We have shown that the dis- 
cretized process is also long memory with the same Hurst exponent as the latent process. We have 
explicitly computed the leading term of the asymptotic expansion of the autocovariance function, 
of the spectral density for small frequencies, and of the root mean square fluctuation for large box 
sizes. We have shown that the autocovariance, the spectral density and the root mean square fluctu- 
ation are asymptotically rescaled by a factor smaller than one, and we have computed exactly this 
scaling factor. More important in all three cases we have computed the order of magnitude the sec- 
ond order correction. This term is important to quantify the bias of the Hurst exponent estimators 
based on these quantities. An in depth analysis of Hurst exponent estimators, namely the peri- 
odogram and the Detrended Fluctuation Analysis, shows that both the estimators are significantly 
negatively biased as a consequence of the order of magnitude of the second order corrections. 

These results can be considered the starting point for several future research directions. A 
first interesting question is whether the strong negative bias in the estimation of the Hurst expo- 
nent observed for the discretization transformation is also observed for other types of (non-linear) 
transformation of the Gaussian process. Since one often obtains non-Gaussian long memory pro- 
cesses by transforming a fGN or a f ARIMA process (Dittman and Granger (2002)), the question of 
the bias of the estimator of the Hurst exponent is of great interest. A second extension is to study 
the effect of round-off error on non- stationary processes. The discretization of a non- stationary 
process is, for example, a more realistic description of the effect of tick size on the price process, 
and therefore it will have more direct applications to the micro structure literature. The discretiza- 
tion of the price process induces round-off errors in the observed returns, and therefore also in the 
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realized volatility. From a financial econometrics perspective, it would be interesting to consider 
also processes with stochastic volatility, i.e. processes where the increments of the process are 
uncorrelated, but the volatility is significantly correlated or is a long memory process. A very 
small scale example of this type of analysis has been performed in La Spada et al. (201 1), where 
discretization of simulated price process with volatility described by an ARCH process has been 
considered, finding qualitative results similar to those presented in this paper. Finally, an interest- 
ing topic for further research is the extension of our results to continuous time process, and then 
the study of the combined effect of round-off error and discrete time sampling. We believe that 
these extensions are potentially of interest in the literature on realized volatility measures, such as 
the realized variance, that are imperfect estimates of actual volatility. 

Since most natural and socio-economic time series are observed on a grid of values, i.e. the 
measured process is naturally discretized, we believe that our results are potentially of interest 
in many contexts. The severe bias of the long memory property of a discretized process should 
warn the scholars investigating processes on the underestimation of the Hurst exponent due to the 
non-linear transformation induced by round-off errors. 
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Appendix A. Distributional properties of the discretized process 

In this appendix we consider the distributional properties of the discretized process of a generic 
weakly stationary Gaussian process. The probability mass function of the discretized process is 



Pd(x) = 2_j q n 5(x - nS) 



(A.1) 



where 



Jrx5(n+l/2) 
p(x)dx 
(5(n-l/ 



(A.2) 



/(5(n-l/2) 

and 6(x) is the Dirac delta function. 

The m-th moment of the discretized process can therefore be written as 



E[x n J] = J] Vn(n6)" 



(A.3) 



Since x is Gaussian distributed, the variance D d of the discretized process can be calculated 
explicitly. From the above expression with m = 2 we obtain 
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(A.4) 



Left panel of Figure [T] shows the ratio D d /D as a function of the scaling parameter £. It is worth 
noting that this ratio is not monotonic. For small £ the ratio goes to zero because 6 is very large 
and essentially all the probability mass falls in the bin centered at zero. In this regime the variance 
ratio goes to zero as 
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When £ » 1 the ratio tends to one because the effect of discretization becomes irrelevant. In this 
regime ZVD- 1 + 1/(120- 

Analogously it is possible to calculate the kurtosis K d = E[x 4 d ]/(E[x d ]) 2 of the discretized 
process. It is direct to show that the kurtosis is 
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For small £ the kurtosis diverges as 



«d - 




1/8? 



(A.7) 



because the fourth moment goes to zero slower than the squared second moment. For large f 
the kurtosis converges as expected to the Gaussian value 3 as kj - 3 - 1/(120 £ 2 ). Note that the 
kurtosis reaches its asymptotic value 3 from below, since it reaches a minimum of roughly 2.982 
at £ - 0.53 and then converges to three from below. 

Trivially, when the discretization is obtained by taking the sign function the variance of the 
discretized process is D d = 1 and the kurtosis is Kd = 1 . 

Appendix B. Proofs for Section [3] 

Proof of Proposition [TJ The discretized process, x d (i), is a non- linear transformation of the un- 
derlying continuous process, x(i). Let us denote the discretization transformation with g(-), so that 



Xd = g(x). From ( 1 1 ) we know that 

ex? 

7d(k) = Y J g 2 j p\k) 



V it = 0,1,... 



7=1 



where p is the autocorrelation function of the underlying continuous process, and gj are defined in 



(12) 



In order to compute the correlation properties of the discretized process we need to compute 
the coefficients gj. Since the function g(x) is an odd function gj = for j even, while all the odd 
coefficients are non-vanishing. Therefore the discretization function has Hermite rank 1 and can 
be written as an infinite sum of Hermite (odd) polynomials. The generic gj coefficient is 

-(«+l/2)/V? 



g j = ^Y j n6 \ Hj(x)e- x2/2 d. 

y2n „trt, J(n-i/2)6 




/2)/V? 



Hj(x)e- X l2 dx 





Figure B.8: Hermite coefficients g 2 { (left) and g 2 (right) of the discretization function as a function of the discretization 
parameter (. The variance of the original process is normalized to 1 . 



The first Hermite polynomial is H\(x) = x and the coefficient gi is 



<D 




(2fc+l) 2 



V^r 



2 (O, e- 1 '^) 



where d- a (u, q) is the elliptic theta function. For large g, g\ =* Vd, while for small £ the coefficient 
g\ goes to zero as 

The second non-vanishing coefficient g 3 is given by 
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B.8 



The functional dependence of g\ and g 2 on £ is shown in Figure 

In principle one could calculate all the coefficients gj. Here we want to focus on the case 
when the correlation coefficient p is small, so that it suffices to consider the first two coefficients 
g\ and g 3 . The reason for this choice is that we are interested in the long memory properties of 
the discretized series which manifest themselves in the asymptotic behavior of the autocorrelation 
function, precisely where p(k) is small. 

Therefore, from (jTT]) we have 



7d(k) = j-7(k) (l + O ( 7 (k) 2 )) for large k . 



If we plug ([[]) and our formula for g\ into (B.l ), we get the result. 



(B.l) 
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Proof of Proposition |2[ From Proposition [T] we know that for large k 
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Under the assumption that L e £(oo, {&,}, {#•}), we can write YZo^ikr^ = £ (l + o(^')) 
and apply Mertens' theorem to get 



J>r^ 



Vi'=0 



2 ^ bfr-jk-^-* = b 2 + o(k-*>) = 0(1) 

i=0 7=0 



By substitution we get the result y d (&) = j iM^i) j ^ £-<* (1 + ( r m¥2« A )| 
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Proof of Corollary [TJ It follows directly from the definition of autocorrelation function and from 
Proposition [T] □ 

Proof of Proposition [3} As the discretization, the sign transformation is an odd function, and 
therefore gj = when j is even. When j is odd the coefficients of the sign function in Hermite 
polynomials are 



-x z /2 



v +1/2 r(j+ 1/2) 



Sj = 2 Hj(x)——dx = 

/o V^r n^(2j+l)l 



By inserting these value in ( 1 1 ) we obtain the autocorrelation (and autocovariance) function of the 
sign of a Gaussian process 



7s(k) = Ps(k) = V g 2 jp(k) J = - arcsinp(fc) 
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For the proof of Proposition [4] we need the following Lemma. Note that the proofs of the 
lemmas are at the end of this Appendix. 

Lemma 1. Under the assumptions of Proposition^ let J a = max{j e N U {0} : (2j + l)a < 1}. 
Then, 



oo oo 



V Yj sl j+ i (k~ a L(k)f i+1 cos(2nka>) = 0(1) for all io. 

k=\ j>J a 

Proof of Proposition |4[ For the sake of simplicity, in the following proof we consider the case of 
an underlying weakly stationary Gaussian process with unit variance, namely D = 1. In order to 
extend the proof to non unit variance we need simply to do the transformation L(k) — > L(k)/D. 
Obviously, L(k)/D is still slowly varying at infinity. 
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From the Wiener-Khinchin theorem we know that 



(p d (iD) = {In) ' 2_j 7d(k) exp 
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Let J Q = maxlj e N U {0} : (2j + \)a < 1}. Obviously, V a 6 (0, 1) 3 J a < +oo. We can write 
Mco) = ^ + |^ £ E && (k- a L(k)) 2j+1 cosCbrfcu) 

k=l j=0 

(B.2) 



+ |" Z Z *t-i (k^mr* 1 cos(2nkco) 



\2;+l 



k=\ j>J a 



The first term in the right hand side of (B.2) is trivially 0(1). From Lemma[T]the last term in 
the right-hand side of (B.2) is also 0(1). As we show below, the second term diverges as to goes 
to zero. 

From Zygmund (1959) (Theorem 2.6 and Theorem 2.15, Chapter V.2) we know that for < 
a < 1, and for any slowly varying function L(k), 

oo 

^r Q 'L(A:)cos(2^M = r(l-a)sinf^)|o;r 1 L(^ 1 ) + o(M Q '- 1 L(a>- 1 )) (B.3) 
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xq, if f(x)/g(x) 



(B.4) 
> 1 as 



as o» goes to zero. Note that we use the notation f(x) ~ g(x) for x 

x — » Xo- 

From Bojanic and Seneta (1973) and Seneta (1976) we know that every power of a slowly 
varying function is still a slowly varying function. Therefore ( |B.3[ ) holds for the first J a terms in 
the asymptotic expansion of the autocovariance of the discretized process. If (2J a + l)a < 1, then 
(B.3 ) holds also for the (J a + l)-th term. If (2J a + \)a = 1, then (B.4) holds. However, as a) goes 
to zero (B.4) is always dominated by (B.3 ). Then, we can write ( |B.2 ) in the following way 



cj) d {co) = g\n~ l T{\ - a) sin(a7r/2)M a_1 £(M _1 ) + o (\a>\ a ~ l L{\coV 1 )) 



By plugging our formula for g\ into (B.5 ) we get the result. 
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Proof of Proposition |5[ For the sake of simplicity, in the following proof we consider the case of 
an underlying weakly stationary Gaussian process with unit variance, namely D = 1. In order to 
extend the proof to non unit variance we need simply to do the transformation L(k) — > L(k)/D. 
Obviously, L(k)/D is still slowly varying at infinity. Moreover, we consider only the case I = oo. 
The case / < oo is trivial. 

From the Wiener-Khinchin theorem we know that 
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If L e £(oo, {bj}, {/?,}), then L(k) = ZT=o bik~ Pi and converges absolutely for large k. Let K > 
s-t. ZSo Ifc.-I*"* < oo and < K~ a £~ fcl*^ < * for a11 k ^ K - 
We split the Fourier transform in the following way 



vM 
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The first two terms are trivially 0(1). The third term diverges as to goes to zero. 

Because £,- bik~^ converges absolutely for all k > K, by Mertens' theorem we can write 
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where the series on the right hand side of (B.6) is the Cauchy product. Note that boj = bj and 
fio,i - fii for all i, while bjfi = b J+ and J3j$ = for all j. By absolute convergence of the original 



series the Cauchy product on the right hand side of (B.6) also converges absolutely for all k > K. 
So, we can write 
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Under our assumption, from Zygmund (1959) (Chapter I, Theorem 2.6) and Theorem 8.3 in 
Rudin (1976) for to > we can exchange the series in ( |B.7[ ) in the following way 

cos(2nkco) = J^ g 2 2j+l ^ bjj Y k H(2M)a+li ' j) cos(27rkto) (B.8) 
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Let J a = max{jfj= NujO): (2j + \)a < 1} and I(j) = {i e N U {0} : (2j + l)a+J3 jti < 1}. 
Obviously J a < oo, 1(f) = for j > J a , and under our assumptions #/(/) < oo for all j^| 



Under our assumptions the series in (B.8) converges absolutely for all to > 0, by the Dini- 
Riemann theorem we can split and rearrange the series in the following way 
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where 77^, = (2j + l)a +p jti . 

6 The symbol #A denotes the cardinality of the set A. 



36 



Lemma 2. The last two terms in (IB. 91) are 0(1). 



Now we focus on the first term in (B.9). First, notice that np < 1 for {0 < j < J a , i € I(j)}. 
From Zygmund (1959) (Chapter 11.13 and Chapter V, Theorem 2.15) we know that for co —* + 
and < ft < 1 



^ k~ fi cos(2nk(o) = T(l -/?) sinOS^/2)|o;r i + 0(1) , 
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Second, in (B.9) we have 2 lo#I(j) < °° terms of this kind, with the only difference that the 
sum over k starts from A; = ZT + 1 instead of k = 1 . Then, we can write 

J a _ J a °° 

Z ^iy+i Z b # Z r "* c ° s ( 2;r ^) = Z && Z ^ Z r "" c ° s o^) + °(!) 
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Now, for sake of simplicity we assume rjjj < 1 for all < j < J a and i e I(j). This simplification is 
not crucial. In fact, if there exist (/, i') s.t. rjfj> = 1, then, according to formulas (B.10) and (B.l 1 ), 
the Fourier transform of these terms will be dominated by the Fourier transform of the other terms 
corresponding to rj^ < 1 as co — » + . From (B.10) we can write 
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where cl are the coefficients given by (B.10) and depend on rjjj = (2j + l)a +fijj. 

The leading term in (B.12) is trivially 0(a) a - 1 ), i.e. when j = and i = 0. If 7(0) = {0} 
(i.e. a +/?i > 1) and J a = U (i.e. 3a > 1), then the sub-leading term is trivially 0(1). On the 
other hand, if #7(0) > 2 (i.e. a + fii < 1) or J a > 1 (i.e. 3a < 1), the sub-leading term is either 
0(a7 tt+/?1_1 ) corresponding to j = and i = 1, or Ofo) 3 " -1 ) corresponding to j = 1 and j = 0. All 
the other terms are either 0(1) or diverge more slowly as co goes to zero, because the coefficients 
Pi are strictly increasing in i. So, the overall sub-leading term will be O (\ C o\ mm ( a+ P 1 ~ 1 > 3a ~ 1 ' n. Then, 
by collecting together all the terms 0(1), for small co we get 

<p d (io) = g\n- l T{\ - a) sm(an/2)bo\aj\ a - 1 (l + O (|<y|»MPi.2«.i-«)^ 

If we want to take into account the case in which some of the exponents r\^ lie on the unit 
circle,we just need to use (B.ll ) instead of ( B.10[ ). If or + /3i = 1, by using ( |B.11[ ) we get 

cp d (co) =g\n- l Y{\ - a) sin(a7r/2)Z> () M Q '- 1 (l + o(\co\ l ~ a In \co\~ l ) + (j(\co\ 2a )) 

Similarly, if 3a = 1, by using ( |B.11| ) we get 

(f, d (co) = gi7r _1 r(l - a)sm(an/2)b \cor 1 (l + o(\(o\ l ~ a \n \co\~ 1 ) + o(\cof [ )) 

If a +fii = 3a = 1, then the error term will simply be O (M 1 ^ In |a»r ! J. 

If we substitute our expression for gi we get the result. □ 

37 



Proof of Corollary |2[ If L is analytic at infinity, then L(k) = 2Z7=o b n k~" for large k for some {b n }, 
and the series converges absolutely inside its radius of convergence. Then, L e X(°o, {£,•}, IAD 
and satisfies the assumptions of Proposition [5J According to our notation fi\ > 1 . By applying 
Proposition p] and our formula for g\ we get the result. □ 

Proof of Proposition |6[ The proof is similar to the proof of Proposition [4j and thus omitted. □ 

Proof of Proposition |7[ The proof is similar to the proof of Proposition [5j and thus omitted. □ 

Proof of Corollary [3} The proof is similar to the proof of Corollary [2} and thus omitted. □ 

Proofs of lemmas for Section^ 

Proof of Lemma [IJ Obviously, V a e (0, 1) 3 J a < +oo. Let 6 > such that (2J a + 3)(a - S) > 1. 
From Bojanic and Seneta (1973) we know that k~ E L(k) — » as k — » oo, for all £ > 0. Then, 
3 if > 1 s.t. k~ 6 L(k) < 1 Vk> K. Sowe can write, 
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The first term in (B. 13 ) is the finite sum of K - 1 terms of the autocovariance of the discretized 



process, and therefore it is trivially 0(1). 



The second term in (B.13 ) can be bounded by a geometric series from above. In fact, from 
TjJ=o 821+1 < °° we know that gj are bounded and go to zero as j goes to 00. Let g = max ;e N g 2 j+i 
and oro = (2J a + 3)(a - 6) > 1 . Then, we can write 
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where L-^J is the largest integer not greater than x. Note that for any integer m>\ there can be at 
most J a + 2 (consecutive) terms s.t. m < i+ J < m + 1. Then, we can write 



do 



E£«M*~f s ^+2)£y>«.y<f^y>-«< 



00 



*=«■ j>7 ff 



fc=£ 7=1 



k=K 



where the last inequality holds because a > L Then, we conclude that the second term in (B.13) 
is 0(1), and therefore we have the result. □ 

Proof of Lemma |H First, notice that rjj ti > 1 for {0 < j < J a , i $ I{j)} and {j > J a , ieNuO). 
Second, note that for j] > 1 
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u v du 
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Let rji = min 77 ,-,• and t] 2 = min rt n . Then, we can write 
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where we used 771,772 > 1, < K~* £, \bi\K~* < 1 and Y™ =Q g\ j+l = 1. 
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Appendix C. Proofs for Section [4] 

For the proofs of Proposition [8] and Theorem [T] we need the following lemma. Note that the 
proofs of the lemmas are at the end of this Appendix. 



Lemma 3. Let m e N, a < 1, an J 

r B 2 ai - 

*(<*) = -a<ar-l)J -^T 

R(a) = - U r 2+a (a(a-l)logt-l + 2a)dt 

Ji 2! 



^d, 



(Cl) 
(C.2) 



where B 2 (x) = 1/6 — x + xr is the third Bernoulli polynomial and {x} represents the fractional part 
of the real number x. Then, R(a) and R(a) converge as m — » 00. 

Note that R(a) and R(a) are the remainders of a first order Euler-Maclaurin expansion of the 
sums 21-=i k a and Y!k=\ k a \ogk, respectively. 

Proof of Proposition |8[ From the proof of Proposition v5\ we know that <p*{a)j) = 1 + CaJj + 
O (<^7 +e ) for some e > 0. So, we can write 
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From Hurvich and Beltrao (1993), 
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where the remainder term is uniform in j. 
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Using the Euler-Maclaurin formula we can write 



£j-iogj = 
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where we used Lemma[3]to establish that the remainders of the Euler-Maclaurin formula are 0(1). 
Using these formulas together with (C.3 1 it follows that £'" =1 a tun = ^= (z^2 m " +1 + ° (m- +1 )) 

J J " V(l+!l) ^ ') 

and X a,jO [(o™ m " +£ ' ~) = 0\\n) m 1°S m )' wnere «i = a + min(a, e). 



i=i 

Since S^ = m(l +o(l)) and since m a[ N~ a ' login = o(m-/N-) by assumption, we conclude 



that 



1 v^ aC(2*r)2 /m\« 



n 



For the proof of Theorem [T] we need the following lemmas. 
Lemma 4. Le? i,k eN, a > 0, an J a £ 1,2. Then, as i —* oo 



j](i - £)F a = A (ar)J 2_a + Ai(ar) » + 0(1) 



A = l 



where A Q (z) 



i 



(z+2)(z+3) 



,!_.„-,_•) andA^z) = - 12 (i-z) + R(z) and R{-) is defined as in {CD. 
Lemma 5. Le? z, A: e N. TTzen, as i — » oo 



k=i 

i 



i\ni+{-—+R l \i + Oa) 



k=i 



-+R 2 \i-lni + 0(l) 



where R\ = R(a = 1) and R 2 = R(a = 2). 

Before proving Theorem [1] we prove the following proposition. 

Proposition 9. Under the assumptions of Theorem^ let us define E,„ = Cov(y(i),y(j)), i.e. the 
covariance matrix of the integrated process (y(l), . . . ,y(m)). Then, 
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(i) if/3 ±2H-l, then 

i 
2H(2H - 1) 



2m = A (e H (i + (r^ 2H -^)) + r(i + ou- miD{2H - Lf3) )) + 



-\i-j\ 2H (i + (\i-jr^ 2H -^))] 

(ii) ifJ3 = 2H-l, then 

*m = 2H( £ H _ X) [?* (1 + Od l - 2H In 0) + f H (1 + 0{t W In 7)) + 
-\i-i\ 2H (l + 0{\i-i\ l - 2H \n\i-j\))] 

Proof. Under the assumptions on x(k), for 1 < i, j, < m we can write 



l J i 

I,„ - Cov(y(i), y(j)) = J] E Cov ( x ^ < 1 ^ = E (/ _ fc) rW + 



fc=l /=1 fc=l 



+ YjU ~ k) y(k) - J](\i - j\ - k) y{k) + min(i, j) r (0) (C.4) 



fc=i 4=1 



where y(0) is the variance of the process x(t). By substituting the explicit functional form for y(k) 
and by the definition of O(-) we get 



V(i - it) y(/c) < A V(i - it) /c 2 "" 2 + M V(i - /c)/t 2 



t 2/Z-2-/3 

fc=i fc=i fe=i 



for some M > sufficiently large. By Lemma 4 we have A X^ =1 (z - k) k 2H 2 = A L' g _ 1} + 0(z')Y 
Now, we consider the following cases: 

(i) /? ^ 2// - 1. In this case we have to distinguish two cases. 
If (3 # 2//, we can use Lemma|4]and obtain 

V (i - it) /c 2 "" 2 ^ = A (2// - 2 - j8) z 2//_/J + Ai (2// - 2 - jS) i + 0( 1) = i ,2H (i~ ™^ m ~ 1 ^ 
k=[ 

If /? = 2H, then we can use Lemma [5] 

J^(i - k) k 2H ~ 2 ~P = l-+R 2 )i-\ni + 0(1) = O (i) 



k=i 



Then, we repeat the same calculation for the second and third term in (C.4). By noting that 
min(z, j) is either of order 0(j) or O(j) and putting together all the terms, we obtain the 
result. 
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(ii) f3 = 2H - 1. In this case we can use Lemma [5] 

£(j - k) k w ^ = Ii In i + l~ + /?! j i + 0(1)) = i M (i 1 -^ In i) 

Then, we repeat the same calculation for the second and third term in ( C.4) . By noting that 
min(i, j) is either of order 0{i) or 0(/) and putting together all the terms, we obtain the 
result. 

□ 
Proof of Theorem [H Following Bardet and Kammoun (2008), 

F\(m) = V } - /W 1} ) V !) - P Ei y (1) ) = Vv !) - y w 'P El y w ) , 

m m 

where E\ is the vector subspace of W" generated by the two vectors e\ = (1, . . . , 1)' and e 2 = 
(1,2,... , m), P El is the matrix of the orthogonal projection on E\, and the second equality holds 
because the projection operator is idempotent. As a consequence, 

E \F\(mj\ = - (Tr(S m ) - Tr(P El X m )) 
L J m 

(i) If /3 ^ 2H - 1, from Proposition [9] we get 

Tr(I m ) = u{ ^ _ - m 2g+1 ( jT* x 2H dx + 0{rn^ H -^) + 0(m~^ 

Note that the error of order 0(m~ l ) comes from approximating the sum with the integral. 
Therefore, we obtain 

Tr(Z m ) = Am2H+1 — (l + 0(mr^ 1H - l H (C.5) 

v H(2H - 1)2//+ 1 v v ' v 

For the term Tr(P £l E„), we use the following representation of the projection operator 

2 



Pf., = 



l(2n + l)-3(i + j) + 6^-\ (C.6) 

V 1 + n) 



n(n — 1) 
Then, using (C.6) and Proposition [9| we can write 

1 ((r, , 1 \ - i + j , , ij 



2Am 2H+l m 2 
Tr(P £l Z„) = 



m(m - l)2H(2H - 1) 

2H , • , 2H 



m m 



yy-v 2 +- -3— +6 

Z_i i—l m l \\ ml m 



, m 2 \\ ml m m(m + 1) 

r=l 7=1 u ' 



(1\ (l +0(r min(2H-l^ + /ZJ ( 1+0(7 -min(2 ff -l^. 



k'-jl 



+ 

1 \mJ v " 7 

2fl 



(i + c ,(i/- J r mm(2// "^ ) )) 
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Approximating sums with integrals we get 

Tr(P El 2 m ) = H A ™ H _ (1 + 0(m-^ 2H -^) + Oinf')) 



f f [(2-3(x + y) + 6xy)(x 2 "+y 2 "H*-yl 2ff )]d;«i;y 
Am 2H+l \+H(4 + H) 



H(2H - 1) (1 + H)(2 + H)(l + 2H) 



(l + 0(m _mil,(2ff - 1 ^ ) )) (C.7) 



Putting together (C.5 ) and (C.7 ) we obtain 



i (Tr(E m ) - Tr(P £l Z m )) = — | _ - f(H)m 2H (l + 0(m- min(2// -^) 

which is the formula of E F 2 (m) I. 
(ii) If /? = 2// - 1, the proof is exactly the same, except for replacing all the terms like 

0( j-mm(2H-\fi)^ w[{h ^ terms Q(j}-2H m ^ 

D 

Proof of Corollary [4} It follows from the autocovariance of a fractional Gaussian noise (see for- 
mula ([3])). The proof is very similar to the proof of Theorem[TJ and thus omitted. However, a com- 
plete proof can be found in Bardet and Kammoun (2008) (see Proof of Property 3.1. therein). □ 

Proof of Corollary |5[ It follows from Proposition [2] and Theorem [TJ □ 

Proof of Corollary [6j It follows from Proposition [3] and Theorem [TJ □ 

Proofs of lemmas for Section^ 

Proof of Lemma |3j First we prove that R(a) converges. Because \B 2 ({1 - t})\ < B 2 (0) = 1/6 for 
all t, we have \R(a)\ < |a( "~ 1 f t~ 2+a dt, where the integral on the right-hand side converges as 
m — » oo because 2 - a > 1. 

Now we prove that R(a) converges. Pick e > s.t. 2 - a - e > 1. One can always find 
such e because a < 1 by assumption. Because log? is slowly varying, there exists T > 1 s.t. 
t~ 2+a log t < r 2+a+E for all t > T. Because \B 2 {{\ - t})\ < 1/6 for all t > 1, we can write 

\R(a)\ < -^ f t~ 2+a (a(a - 1) log t - 1 + 2a) dt + ^±12 f r 2+ff+£ d? 

where the second integral converges asm-x» because 2 - a - s > I. □ 

Proof of Lemma |4j By using Euler-Maclaurin formula up to the first order we obtain 

>p- - ;(^ + ^ + ff<-»)<r- -!) + *<«)) 
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where B 2 = 1/6 is the third Bernoulli number, and R(a) is the remainder of the Euler-Maclaurin 
expansion given by (C.l ). From Lemma [3] we know that R(a) converges, as i —* oo. So, we can 



write 

v^ _ v f~ a i l ~ a ai~ a 

i \ fc-a = + Al(Qr)/ + 

-f-r 1 1 - or 2 12 

K=l 

where Ai(-) is defined as in Lemma[4) Note that the second-order term is 0(i). 
Similarly, we can write 

' ;2-a :\-a /i _A -•— or 

Z, „ z z ( 1 — or) 2 

^" ff = ~ + -z- +A l (a - 1) + J 

2- a 2 12 

Note that in this case the second-order term is 0(j}~ a ). 

Putting together these two terms we have 2[=i( z _ k)k~ a = A (a)i 2 ~ a + A\(a)i + 0(1). where 
A (-) is defined as LemmaWJ Note that the terms of order i l ~ a cancel out exactly. □ 



Proof of Lemma |H The proof is similar to the proof of Lemma |4j and thus omitted. □ 
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